Dialogue Context for Visual Feedback Recognition
نویسندگان
چکیده
Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. When recognizing visual feedback, people use more than their visual perception. Knowledge about the current topic and expectations from previous utterances help guide our visual perception in recognizing nonverbal cues. In this chapter, we investigate how dialogue context from an embodied conversational agent (ECA) can improve visual recognition of user gestures. We present a recognition framework which (1) extracts contextual features from an ECA’s dialogue manager, (2) computes a prediction of head nod and head shakes, and (3) integrates the contextual predictions with the visual observation of a vision-based head gesture recognizer. We found a subset of lexical, prosodic, timing and gesture features that are easily available in most ECA architectures and can be used to learn how to predict user feedback. Using a discriminative approach to contextual prediction and multi-modal integration, we were able to improve the performance of head gesture detection even when the topic of the test set was significantly different than the training set.
منابع مشابه
Towards Context-Based Visual Feedback Recognition for Embodied Agents
Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. We investigate how contextual information can improve visual recognition of feedback gestures during interactions with embodied conversational agents. We present a visual recognition model that integrates cues from the spoken dialogue of an embodied agent with...
متن کاملContext-based visual feedback recognition
During face-to-face conversation, people use visual feedback (e.g., head and eye gesture) to communicate relevant information and to synchronize rhythm between participants. When recognizing visual feedback, people often rely on more than their visual perception. For instance, knowledge about the current topic and from previous utterances help guide the recognition of nonverbal cues. The goal o...
متن کاملContext and Dialogue Control
Dialogues are usually motivated by some underlying, noncommunicative goal. Participants in a dialogue therefore perform two tasks at once: that of trying to achieve the underlying noncommunicative goal, and that of communicating in order to achieve the associated communicative goal. This is reeected in the fact that dialogues consist not only of elements motivated by the underlying task, but al...
متن کاملIntegration of Visual Perception in Dialogue Understanding for Virtual Humans in Multi-Party interaction
While the dialogue functions of speech in two-party dialogue have been extensively studied, there has been much less work on either multi-party communication, multimodal communication, and especially vision in a multi-party face-to-face setting. In this paper we report on one such effort to apply state of the art real-time visual processing to enhance a dialogue model of multi-party communicati...
متن کاملThe Role of Context in Affective Behavior Understanding
Face-to-face communication is highly interactive. Even when only one person speaks at the time, other participants exchange information continuously amongst themselves and with the speaker through gesture, gaze, posture and facial expressions. Such affective feedback is an essential and predictable aspect of natural conversation and its absence can significantly disrupt participants ability to ...
متن کامل